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Abstract 

Given experimental evidence at the LHC for physics beyond the standard model, 
how can we determine the nature of the underlying theory? We initiate an approach 
to studying the “inverse map” from the space of LHC signatures to the parameter 
space of theoretical models within the context of low-energy supersymmetry, using 
1808 LHC observables including essentially all those suggested in the literature and 
a 15 dimensional parametrization of the supersymmetric standard model. We show 
that the inverse map of a point in signature space consists of a number of isolated 
islands in parameter space, indicating the existence of “degeneracies”—qualitatively 
different models with the same LHC signatures. The degeneracies have simple physical 
characterizations, largely reflecting discrete ambiguities in electroweak-ino spectrum, 
accompanied by small adjustments for the remaining soft parameters. The number of 
degeneracies falls in the range 1 < d < 100, depending on whether or not sleptons are 
copiously produced in cascade decays. This number is large enough to represent a clear 
challenge but small enough to encourage looking for new observables that can further 
break the degeneracies and determine at the LHC most of the SUSY physics we care 
about. Degeneracies occur because signatures are not independent, and our approach 
allows testing of any new signature for its independence. Our methods can also be 
applied to any other theory of physics beyond the standard model, allowing one to 
study how model footprints differ in signature space and to test ways of distinguishing 
qualitatively different possibilities for new physics at the LHC. 


1 The LHC Inverse Problem 


With the imminent start of the LHC in 2007, particle physics is on the threshold of its most 
exciting period in over thirty years. The secrets of the TeV scale will begin to be revealed, 
and whatever is found is likely to have profound implications for fundamental physics. With 
only two years to go, what are the most pressing remaining open questions in physics beyond 
the standard model? Over the last two decades, many models of weak scale physics have 
been proposed, starting with the early proposals of supersymmetry P and technicolor P, 
through to the more recent ideas of large and warped dimensions Pi, and the little Higgs 
p. The last three years have seen an explosion of differing models—ranging from Higgsless 
models P, composite Higgs in warped compactihcations P and other warped models PP, 
twin Higgs UDI, and SUSY little Higgs models m — largely arising from combining previous 
ideas in a variety of ways. More recently, orthogonal “un-natural” directions have also been 
opened up, inspired by the possibility of an enormous landscape of vacua in string theory, 
beginning with the proposal of split supersymmetry 1121 and other minimal models for dark 
matter and unihcation [13 in. 

However, it is clear that the era of speculation and model-building is near an end—we 
will very soon get a direct hint for what is going on from early LHC data. It is unlikely that 
exploring the {N + 2)nd variation on the {N + l)st mechanism for electroweak symmetry 
breaking is particularly important at this moment in time. Unless some qualitatively new 
ideas and associated signals are involved, one might as well wait and see what nature tells 
us before investing a lot of time on detailed model variations. 

Instead, there is another problem, far more urgent especially as the LHC draws near, 
that has received less attention than model-building. How will we determine the underlying 
new physics from LHC data? Of course the hrst important question is: how will we know 
there is any new physics beyond the standard model? It is possible that the signals will be 
sufficiently spectacular as to immediately tell us there is new physics and narrow the space 
of possibilities to a very small number. For instance, if the LHC hnds evidence for a colored 
particle decaying an observable distance from the beamline, this would look a lot like split 
SUSY or perhaps other models with nearly stable colored particles, but not anything like a 
garden-variety supersymmetric model. 

Even without such dramatic signatures, if there are new colored particles beneath the 
~ 2 TeV scale that are produced and decay to SM particles, we can fairly quickly be con¬ 
vinced of the existence of new physics. But determining the properties of the underlying 
model becomes more challenging, even at a rough level. For instance, can we know if we 
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Figure 1: The Inverse Map from LHC Observables to Theoretical Models. Given observed 
signals for physics beyond the standard model, how can we determine the underlying theo¬ 
retical model? 

have discovered SUSY, or are the signals with trileptons and missing energy due to extra 
dimensions with KK parity ns. or little Higgs theories with T-parity usi? Despite some 
recent studies [i7i[iHi[inii2niinii2ai23i2i, not much systematic work has been done on 
this “inverse” problem 1^. Let’s suppose even that we are working in the context of 
low-energy SUSY with minimal field content. Will we be able to determine even qualitative 
properties of the spectrum? Can we, for instance, tell even roughly whether or not the gaug- 
ino masses are consistent with GUT scale unification? Whether the LSP is a good candidate 
for Dark Matter? 

Instead of addressing this question, most of the work on collider phenomenology to date 
has been done in the “forward” direction, studying the map from parameter or model space 
into the space of observable collider signatures. The signals for a specihc model are studied 
in great detail, with the goal of seeing how well the parameters of the model can be mea¬ 
sured or constrained. Often, many of the signals are tailor-made to the model at hand and 
aren’t effective for other models, particularly not for the general case. To make the studies 
tractable, they are usually performed within simplified models with very few parameters—in 
the context of SUSY, for instance, these have been carried out with mSUGRA, gauge me¬ 
diated and anomaly mediated SUSY breaking j^Zl |2Hl 12^1120112111221 1^ - for several recent 
studies, see jnH 123 123 123 I2H11221 HO] • Presumably, the hope is that if enough models are 
simulated in the forward direction, we will gain familiarity with the associated signals and 
will be able to spot them if they arise at the LHG. 

But the LHG inverse problem—studying the map from LHG signatures to weak scale 
models as in hgure^— is more interesting, important and challenging. At a hadron collider, 
it is difficult to directly measure masses and other properties of new particles, a problem 
exacerbated in SUSY by the escaping LSPs carrying away missing energy. Plus, any signal 
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Figure 2: The Inverse Map in the Best and Worst of All Possible Worlds. Ideally, the 
image of LHC data onto parameter space would specify an unique underlying model. In the 
most pessimistic scenario, LHC data would suggest the physics beyond the standard model 
without giving us any clues as which model describes the new physics. 

is likely to receive contributions from multiple channels, and the signals that are reliably 
observable are more limited. A given set of observations at the LHC select a small region 
point in signature space, whose size is determined by both by intrinsic statistical/quantum 
(“\/]V”) fluctuations as well as by experimental errors. The question is, what does the inverse 
map back to parameter space look like? Even very basic issues are open—for instance, is 
this map one-to-one? 

As shown in £gure|21 in the best of all possible worlds, the inverse map from a small region 
in signature space would pick out a small region in parameter space, so that the underlying 
model would be uniquely picked out, and better measurements of the signals would yield a 
more accurate determination of the model parameters. In the worst of all possible worlds, 
the inverse map into parameter space would £11 out a huge continuously connected region of 
parameter space, giving us no handle on even the basic structure of the underlying theory. 

In this paper, we initiate a systematic approach to the LHC inverse problem in the context 
of low-energy SUSY. As we will see, in SUSY the actnal pictnre is an intermediate one—the 
inverse map consists of a number of isolated islands in parameter space as in hgure El While 
each island is small, there are a number of them corresponding to qnalitatively di£erent 
models. For obvious reasons we call these “degeneracies”—di£erent models with the same 
LHC signatures. We will be able to give a simple physical interpretations of the main kinds 
of degeneracies, which arise largely from discrete ambiguities in the electroweak-ino sector. 

In the next section, we describe our approach to understanding the map from signature 
space to parameter space, and describe the simple statistical techniques that allow us to 
indirectly infer the expected number of models with degenerate LHC signals. In section 
El we then snmmarize onr pictnre of the inverse map, together with an ontline of a simple 
physical characterization of the dominant source of degeneracies. In section (H we present the 
details of onr specihc stndy, inclnding the MSSM parameter space we scanned and the 1808 
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Figure 3: Our Picture of the Inverse Map. In the context of low-energy supersymmetry, 
we hnd that the inverse map of LHC data consists of a number of disconnected and widely 
separated regions in parameter space. This indicates the presence of degeneracies—different 
underlying models that share the same LHC signatures. 

LHC signals we considered. In sections El and IHl we give a more detailed characterization of 
degeneracies and provide specific examples of pairs of degenerate models exemplifying each 
class of degeneracies. Part of the reason for the degeneracies is that, in most of our parameter 
scan, on-shell sleptons are not copiously produced in cascade decay chains, so there is less 
leptonic information. In section El we do a dedicated scan over a parameter space where 
sleptons are typically present in the decay chains, and find that as expected, the number of 
degeneracies is significantly smaller. In sections |H1 and El we describe possible improvements 
and extensions to our work and outline how one should use our study after a real discovery 
at the LHC. We end with a discussion and outlook on future work in this area. 


2 From Parameters to Signatures and Back 

2.1 Parameter and Signature Space 

Before studying the inverse map from signature space to parameter space, we should specify 
what each of these spaces are. Of course, the most pressing question at the LHC will be to 
hgure out whether there is any evidence for physics beyond the standard model, and then 
most broadly what theoretical framework best describes the new physics—for instance is it 
SUSY, or strong dynamics, or something else? As has been increasingly appreciated over 
the last couple of years, even here there are possibilities for degeneracies. In particular, any 
model with a stable particle protected by a parity symmetry has similar looking missing 
energy signals as SUSY, and it is a challenge to decisively distinguish the models. As far as 
we can see, there is no systematic approach to this problem yet. 


4 





But there is also an analog of the inverse problem purely within the context of low-energy 
SUSY. The parameter space of the model is so huge, and gives rise to such a large range of 
possible signals, that we can already ask, even assuming we have low-energy SUSY, whether 
we can even roughly determine the correct region of parameter space from LHC data. In this 
context, the “parameter space” is clear—hxing for instance the minimal held content of the 
supersymmetric standard model, we can vary the ~ 105 soft parameters of the theory. In 
practice, this is much too large a number, and it is useful to consider a smaller subset that still 
captures much of the variety of physics to be expected. Many of the 105 soft parameters are 
relevant only to havor physics and do not have much effect on collider physics. These consist 
of three gaugino masses Mi^ 2 , 3 , the Higgsino mass parameter /r, degenerate soft masses for 
the hrst two generations of Q, L, E’^ helds (in order to avoid large havor violations), 

and separate soft terms for the third generation scalars. If we also include tan /3, the ratio 
of the up- and down-type Higgs vevs, this gives a total of 15 parameters. 

The “signature space” is also easily dehned, as we will do in more detail in .section BTTI 
After imposing the appropriate cuts, we associate a number with every LHC observable. For 
instance, any number count of events of a particular kind is a direction in signature space. 
Any kinematic histogram can be divided up into deciles, and the boundaries of each decile is 
a direction in signature space. In this way, we quickly get a huge dimensionality for signature 
space—the LHC observables in our study span 1808 dimensions. 

Since signature space has a much higher dimensionality than parameter space, naively 
one would think that the inverse map from signature to parameter space would be unique. 
Actually, this is incorrect, largely because the signatures tend to be highly correlated with 
each other, so the effective dimensionality of signature space is much smaller. In fact, we can 
imagine dividing signature space up into bins, with size determined by statistical fluctuations 
and experimental errors. Even scanning over all possible MSSM parameters, the number 
of bins in signature space—the number of experimentally distinguishable outcomes at the 
LHC—is not enormously large. One of our results will be a quantitative measure of this 
number which will help us understand important aspects of the inverse map. 

An obvious strategy for studying the inverse map is to simply simulate the MSSM in all 
regions of its parameter space. For instance, we can imagine taking each soft mass parameter 
in 100 GeV increments between, say, 100 GeV and 1 TeV. Even hxing tan f3 and taking our 
simplihed 14 relevant soft parameters at the LHG, this is a total of ~ 10^^ models. In 
practice it is impossible to simulate so many models—simulating the hrst year of LHG data 
for a single SUSY model with fast detector simulation takes about 1 GPU hour. This is why 
so many studies are performed in models with a much smaller dimension parameter space. 
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Figure 4: The Birthday Problem for the MSSM. We simulate m models and associate each 
model with a bin in signature space. For m iV it is unlikely for any given pair of models 
to share the same LHC signatures, but there is a statistical expectation value W ~ mV(2iV) 
for the total number of pairs that end up in the same bin. 

like the 5 dimensional parameter space of mSUGRA. 


2.2 Statistics and the Birthday Problem 


However, we can get a good idea of what the inverse map looks like even with a large 
parameter space without having to simulate so many models. We do this by simulating m 
models, but then comparing the signatures between all m{m — l)/2 pairs of models. This 
gives us statistical leverage and also allows us to determine gross properties of signature 
space, including its real effective dimensionality as well as the the total number of “bins” or 
experimentally distinguishable models. 


We can do all of this because of the famous “birthday problem” shown in hgure |31 To 
take a simple analogy—suppose we throw balls into a box with N bins; we can’t see inside 
the box so we don’t know how big N is. But the balls are sticky, so that if two balls land 
in the same bin, they stick together. We can throw in as many balls as we want, and then 
empty the box. One might think that to determine N, one has to throw in m = balls to 
cover all possible bins, and then the pigeonhole principle would guarantee seeing a pair of 
balls stuck together when m = N + but this is not the case. If we assume that the balls 


fall into the bins randomly, on average the number of bins with p balls Np is^ 






p\NP 


-1 


( 1 ) 


In particular, the number of doubles is W = m^/{2N). So, when the balls are dumped 

^This definition of Np is still approximately true even for large m and small N if we say that a bin with 
q balls makes a contribution of q choose p to the expectation value of Np. 
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Figure 5: The (AS*^, AP^) plot in the best of all possible worlds. The expected conhdence 
range on model parameters is dehned by the maximum value of AP^ at the AS^ correspond¬ 
ing to statistical fluctuations. 

out of the box, we can see how many doubles N 2 there are, and this allows us to determine 
N as N = m?/{ 2 N 2 ). Furthermore, if we saw some triples and quadruples too, we could test 
our hypothesis by seeing whether the value of N we extract from all of them are consistent 
with each other. Clearly, for this pair-wise counting strategy to be effective, we have to 
have m ~ a/A to have at least a few doubles, but this is a big improvement over the naive 
expectation that we need m ~ A. 

We follow this strategy in our study. We simulate m models and associate each with 
a point in LHC signature space. Any pair of models has an associated distance AP^ in 
parameter space and A5^ in signature space^. We can look at a 2D plot of the points 
(AS*^, AP^) for all m(m —1)/2 pairs of models. In the best possible situation, this plot would 
look like a narrow triangle, as in hgure El The minimal AS‘^ corresponding to a/A error 
would correspond to a small AP^ in parameter space and larger AF^ would be associated 
with larger AP^. 

2.3 Degeneracies 

The real situation with the MSSM is rather different—the plot for the models we simulate 
is shown in hgures El and 0 We will specify our parameters and signatures in more detail 
in section 0 For the present it suffices to say that m = 43026 MSSMs were simulated. 
The vertical dashed line in hgure 0 marks AF^ = 0.285, corresponding to the distance in 
signature space when the same model is repeatedly simulated, giving us a measure of the 
size of statistical huctuations. 

^See section 01 for precise definitions of AP^ and AS"^. 
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Figure 6: A cartoon of (AS*^, AP^) plot in the general MSSM. At the AS‘^ value corre¬ 
sponding to statistical fluctuations, the maximum value of AP^ exceeds the target accuracy 
on model parameters. While it is possible to change the target accuracy value to formally 
decrease the number of degeneracies, for any reasonable choice (such as 10% or 20% er¬ 
ror), there is still a sizable number of degenerate pairs. Notice that target accuracies of the 
parameters usually correspond to a much larger region in the parameter space than that is 
occupied any small island. This corresponds to the existence of many “cliffs”, see section |231 
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Figure 7: The (AS*^, AP^) plot for the MSSM. In the left plot, we measure AP^ using 
the 14 mass parameter we scan, where AP^ ~ 0.01 roughly corresponds to 10% accuracy 
in all parameters. The position of the vertical dashed line corresponds to the distance in 
signature space from statistical fluctuations, so the fact that there is a sizable number of 
model pairs to the left of this line with large AP^ indicates the presence of degeneracies. The 
right plot shows the same models with AP^ measured only using the gaugino and higgsino 
mass parameters. As we will explain in sectional the break in the plot between small and 
large AP^ comes from a bimodal behavior in the electroweak-ino sector, which explains the 
dominant reason why there are degeneracies in the MSSM. Note that only a small number 
of pairs are shown on these plots; the AF^ values extend out to AS^ > 600. 
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The most striking feature of this plot is the clear presence of a relatively large number of 
degenerate models—there are many points with small but large AP^. (As we explain in 
section lOl AP'^ 0.01 corresponds to parameters matching to 10% accuracy.) The 43026 
models represent a very sparse sampling of the 14 dimensional parameter space, and yet 283 
pairs of models share indistinguishable signatures! 


From our “birthday problem” picture, this is clear evidence that the number of effective 
“bins” in signature space is not enormously large. The expected number of signature bins is 


Nsig ~ 


m 


3.3 X 10^ 


( 2 ) 


As described in appendix ^ we can conhrm that this picture is correct by extracting a value 
of N also by looking at the number of triplets and quadruplets of models with the same 
signatures. The value of N extracted in this way is indeed of the same order of magnitude 
as above. The £t can be further improved by taking into account that some fraction of the 
bins are more likely to be populated than others. 


Our estimate for gives us a good idea of the number of distinguishable bins in signa¬ 
ture space—that is, the number of possible distinguishable MSSMs in the parameter space we 
scanned. But the actual number of degeneracies we care about is criterion dependent. For 
instance, if two models are degenerate with mass parameters differing by, say, only 5% from 
one another, then we may not wish to count these models as being “really different”. We can 
dehne what we mean by “good pairs” which are sufficiently close in parameter space that we 
would expect them to be close in signature space. Given such a criterion, the expectation 
value for the number of degeneracies from hgure |H1 is 

number of pairs close in signature space 

degeneracies) =---—-—-^-. (3) 

number of good pairs 


For instance, for our set of 283 LHC indistinguishable pairs, we can decide to declare 
two models “the same” if they have the same ordering of the ino spectrum. In this case, 
the number of degeneracies is (d) = 4.4—that is, for a given model, there are about 4 other 
models with the same LHC signatures but a different ino ordering. Or we can decide that 
two models with ino masses within 10 percent of each other are “the same”. In this case, 
(d) = 12.9, so we expect for any given model, there are about 12 others with the same LHC 
signatures but with ino masses differing by more than 10 percent. There are only 2 out of the 
283 pairs whose ino and squark masses match to 10%, so using that criteria we would expect 
(d) r'U 140. Infigureini we show how the number of degeneracies changes as we adjust the 
fractional acceptance in parameter space between pairs of models, showing that the source 
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Figure 8: A cartoon showing how to count the expectation value for the number of de¬ 
generacies and cliffs. Region G corresponds to “good pairs” of models that are close both 
in parameter space and signature space. Region D correspond to pairs of models that are 
degenerate, i.e. close in signature space but well separated in parameter space. Region C 
corresponds to “cliffs”, where the distance between models in signature space is large despite 
the models’ proximity in parameter space. 


of degeneracies is not coming only from large error bars on individual parameters but also 
from discrete choices in the SUSY spectrum. 


So we see that the number of degeneracies is G(10 — 100) with the criteria we have 
specihed. We can get a rough idea of whether this number is in the right ballpark by 
estimating the number of degeneracies in a different way. We have determined that there 
are Wig ~ 3 x 10® possibly distinguishable MSSMs. We can also estimate the number of 
“different” models Ymodeis we have in our parameter space based on the parameter ranges 
given in section Assuming that LHC signatures are sensitive to gluino mass variations of 
50 GeV, squark mass variations of 75 GeV, and electroweak-ino mass variations of 100 GeV, 
then^ 

Yn^odels ~ 8 X 5® X 93 ~ 10«. (4) 

Since Ymodeis > Wig, by the pigeonhole principle there must be degeneracies. Furthermore, 
a rough estimate for the number should be 


(degeneracies) 


^models 

iW 

-''sig 


~ 30 


(5) 


which is indeed consistent with our hrst estimate using equation ©. 

•^These values are chosen by estimating the local variation in mass parameters from the (AS”^, AP^) plots 
in section 0 
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Figure 9: Number of degeneracies as a function of allowed fractional error. Triangles corre¬ 
spond to imposing fractional acceptance on gaugino, higgsino, and squark parameters. Dots 
correspond to just imposing fractional acceptance on the inos. The fact that the number of 
degeneracies asymptotes to 2 indicates that the degeneracies are not caused simply by the 
presence of large error bars on model parameters but also because of discrete choices in the 
spectrum. 

2.4 Cliffs 


Another important feature of the {AS'^,AP^) plot is the population of points along the 
horizontal axis, which shows that there are models with small distance in parameter space but 
large differences in signature space. These indicate the existence of “cliffs” in model space— 
small parameter changes can give rise to large changes in the signatures. In particular, this 
suggests that in any local region of parameter space, the map to LHC signature is essentially 
one-to-one. 

We can easily quantify this notion. Given any criterion for whether two models share the 
same “parameter bin”, we can find the total number of pairs of models which are close in 
parameter space and compare with the number which are close in both signature space and 
parameter space. This defines an expected value of cliffs, i.e. the chance that two models 
within the same parameter bin end up in the same signature bin. 

number of pairs close in parameter space 


(cliffs) = 


( 6 ) 


number of “good” pairs 
We can look at a plot of the number of cliffs as a function of changing the signature cut that 
defines “good” pairs. Depending on the criteria for closeness in parameter space, we see that 
the number of cliffs is C1(10^)-C1(10^), showing that a single bin in parameter space maps to 
many different bins in signature space. This confirms the picture in figureElthat the isolated 
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Figure 10: Number of cliffs as a function of signature cut. (AF)^ = 0.285 corresponds to sta¬ 
tistical fluctuations. Triangles correspond to imposing 10% percent acceptance on gaugino, 
higgsino, and squark parameters. Dots correspond to imposing 10% percent acceptance on 
just the inos. The large number of cliffs indicate the strong sensitivity of LHC observables 
in local regions of parameter space. 

regions in parameter space that map to the same signature bin are indeed “small”. 


2.5 Effective Dimensionality of Signature Space 


Formally, our signature space is very high dimensional. Of course, we are mapping a 15 
dimensional parameter space onto signature space, so we do not expect the signature space 
manifold spanned by SUSY models to be more than 15 dimensional. However, the existence 
of degeneracies strongly suggests that there are huge correlations between the signatures, 
such that the actual dimensionality of the space populated by SUSY models is far smaller. 


We can quantify this simply by looking at how the number of degenerate model pairs 
N 2 depends on the distance in signature space AS; if signature space is effectively Dgig 
dimensional we expect 

N 2 ~ (7) 


Now, the precise dehnition of AS requires some care. If we take the dehnition from section 
14.31 and £t to N 2 , we find in figure HD that 


No 


(A^)9'2 near (A^)^ = 0.3 
(AS')^^'^ near (AS')^ = 0.6 


( 8 ) 


suggesting that the dimensionality of signature space in the vicinity of degenerate pairs is 9 
to 14 dimensional. 
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Figure 11: Dimensionality of signature space. (AS')^ = 0.285 corresponds to statistical 
fluctuations, and the maximum value of {AS^ between pairs of models is AS'^ > 600, well 
off these plots. On the left plot, we £t to the power law N 2 ~ (AS')^"'*^ with exponents 
Dsig ~ 9 and 14. On the right plot, we take into account the possibility of an offset value 
ASo, yielding D^ig ~ 5 or 6 (both curves are shown overlapping). In either case, we see that 
the effective dimensionality of signature space is much smaller than the dimensionality of 
parameter space, giving another justihcation for degeneracies. 


Still, the fact that N 2 does not seem to follow a simple power law in AS indicates that 
AS may not the best measure of distances in signature space. In particular, even simulating 
the same model repeatedly gives some hnite value of AS, so one might guess that for a 
signature space of dimension Dgig: 

N 2 ~ (A^ - ASo)^^'^^ or N 2 ~ i^AS^^'A^y^'". (9) 

In figure [TTJ we see that this hypothesis is valid over a wide range of signature values, 

simultaneously giving good fits to^ 

N 2 ~ (^^(A^)2 - (0.42)2^’^ and N 2 ~ (A^ - 0.39)^'^ . (10) 

These hts suggest that the true dimensionality of signature space is Dsig ~ 5 or 6, less 

than the number of MSSM parameters that we are varying. This makes it clear that in 
order to further break degeneracies, it is not enough to add a few extra signatures; one 
must add signals that are sufficiently orthogonal to the existing ones to increase the effective 
dimensionality of signature space. 

Ideally, we could hgure out what the 5 or 6 independent signatures are in a statistical sense 
by doing a multivariate regression on our LHC signatures. Unfortunately, it is computational 

^The number A5'o = 0.42 or 0.39 is interesting, because it is around the minimum A5' between identical 
models run with different random number seeds. 
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difficult to do such a regression on a 1808 dimensional signature space. Moreover, we expect 
at least some of the independent signature directions to change depending on the specihcs 
of the MSSM spectrum. 


3 Our Picture of the Inverse Map 

The existence of degeneracies and cliffs substantiates the rough picture of the inverse map we 
suggested in hgure El consisting of a number of small islands spread out over a large region in 
parameter space. The small size of the islands is reflected in the existence of cliffs, indicating 
that LHC observables are indeed sensitive to small parameter changes. The existence of 
many islands far apart in parameter space is substantiated by the presence of degeneracies. 

Note that the hgure assumes that the islands are “point-like” in parameter space. We 
actually don’t know that this is the case—they may be higher-dimensional manifold like tubes 
or sheets in parameter space. Indeed, if there are obvious flat directions in parameter space 
where for instance a particle is sufficiently decoupled as to play little role in the signatures, 
these would show up as higher-dimensional objects in the inverse map. All we can say with 
conhdence from our analysis of cliffs is that the islands are not space-hlling and constitute a 
small part of the total volume of the inverse map from a given signature. 

The degeneracies have simple physical interpretations as shown in hgure El As is com¬ 
mon in hadron collider phenomenology, the cleanest handle on new physics often comes 
from looking at leptons. In SUSY models where sleptons are not copiously produced in a 
long SUSY cascade decay chain, the leptons dominantly come from Ws and Zs produced in 
electroweak-ino cascade decays. In this case, we hnd large ambiguities in the spectrum of the 
remaining electroweak superpartners. Two models can have identical LHC signals by having 
“dippers” where electroweak-ino mass eigenvalues are hxed but with different eigenstates, 
“sliders” where the electroweak-ino spectrum is moved up or down keeping mass differences 
hxed, and “squeezers” where the information of some of the electroweak-inos is hidden be¬ 
cause the mass splittings are small enough that the leptons in the decay products are too 
soft to be seen. We will discuss these degeneracy classes further in section El 

Of course, the changes in the electroweak-ino sector are accompanied by suitable changes 
in the colored superparticle spectrum to match rates and other kinematical distributions 
between the models. The number of degeneracies for a given model arising in this way is 
of order 10 to 100. As we show in section El when sleptons are forced to be present in 
the cascade decays, there are more leptons in hnal states and the degeneracies virtually 
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Figure 12: Dominant Degeneracies: Flippers, Sliders, and Squeezers. When scanning over 
the general MSSM, sleptons are generically decoupled, therefore important leptonic handles 
on the electroweak-ino spectrum are lost. This allows for three major classes of degeneracies. 
In a “flipper”, the identity of an electroweak-ino is changed while the mass eigenstates 
are fixed. In a “slider”, the entire spectrum is shifted while fixing mass differences. In a 
“squeezer”, a low-energy eigenstate is hidden because the decay products between adjacent 
mass eigenstates are soft. In the above cartoons, “5” and “kF” can stand for a bino, a wino, 
or a higgsino. 


disappear, although there may still be an ambiguity in swapping the left-and right-handed 
sparticle spectrum. But in the general case, our estimate of the number of degeneracies 
is as interesting as it could have been—the number is not one {i.e. there certainly are 
degeneracies) but nor is it 10®. Therefore while the existence of degeneracies represents a 
challenge, it is one that can likely be overcome by devising clever new observables to eliminate 
them. This would allow us to determine essentially all important aspects of SUSY physics 
with LHC data. 


4 Details of Our Study 

Even though the birthday problem has reduced the number of models to simulate from 
m ~ Y to m ~ y/N, the number N of experimentally distinguishable possibilities at the 
LHC is still quite large, and there a number of compromises one must make when comparing 
a large number of different models. 

The first compromise is on the amount of data we can generate. For gluino masses around 
600 GeV, there are around 10^ SUSY events at an integrated luminosity of 10 fb“^. Even 
without including the effect of initial state radiation and multiple interactions, it takes a fast 
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CPU over one hour to simulate that many events, and a simplihed version of the event record 
is roughly 50 megabytes in size. Most previous collider studies assume that the LHC will 
achieve an integrated luminosity of 300 fb“^, but it is simply impractical for us to generate, 
store, and analyze that much data for a large number of models. Therefore, we only generate 
10 fb“^ of data for each model and force all colored particles to be heavier that 600 GeV. 
While the constraint of heavy colored particles was necessitated by computational limita¬ 
tions, there are independent reasons for making this choice. The characteristic signature 
of SUSY at the LHC is hard jets plus missing transverse energy, so while the cross section 
for SUSY increases as the gluinos and squarks get lighter, the jets from SUSY cascades get 
softer, making it more difficult to identify a pure sample of SUSY events. 

The second compromise is on standard model background. While it is certainly possible 
to estimate the effect of standard model background on 10 fb“^ of data, the focus of this paper 
is not on separating SUSY signals from standard model background but on distinguishing 
between different SUSY models. At 300 fb“^, one can make hard cuts and still maintain 
decent statistical signihcance of leptonic signatures, but this is difficult at 10 fb“^. Therefore, 
we ignore standard model background in this study except as a guide for dehning reasonable 
cuts and triggers. 

This is simultaneously an optimistic and pessimistic choice, because by ignoring standard 
model background we are inflating the statistical signihcance of our small data sample, but 
by ignoring the effect of higher luminosity we lose access to rare processes that may give 
important clues in deciphering the data. We emphasize that both of these compromises are 
not intrinsic limitations, but are dictated by our current computing resources. 

4.1 LHC Observables 

Broadly dehned, LHC data is anything that can be measured with an ATLAS- or CMS-like 
detector with delivered luminosity. To simulate 10 fb“^ of LHC data, we use PYTHIA |1T] 
to generate parton level interactions and hadron showering and pipe PYTHIA output to a 
modihed version of the CDF fast detector simulator PCS written by John Conway jlSj. This 
modihed version was developed by Steve Mrenna and approximates an ATLAS- or CMS-like 
detector. PCS yields reasonable efficiencies and fake rates and includes the effect of energy 
smearing. 

We use a simplihed output from PCS for our study, namely a list of objects in each event 
labeled by their identity (photon, electron, muon, hadronic tau, jet, b-tagged jet, missing 
Ej) and their four-vector. Leptonic objects are also labeled by their charges. Using this 
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information, one can construct almost any LHC signature imaginable. Of course, when real 
data from the LHC arrives, various different techniques will be used to isolate, verify, and 
make measurements on samples. Because we are ignoring standard model background in our 
analysis, our goal is to choose a set of observables that are sensitive to MSSM parameters but 
which are sufficiently inclusive to be useful over a wide range of parameter values. Note that 
we make no attempt to interpret any of our signatures in terms of cross sections, branching 
ratios, or mass differences in the underlying model; instead, we simply compare raw signature 
values between different models. 

While we are not including standard model background, initial state radiation, or multiple 
interactions in our analysis, in appendix El we select cuts and triggers in a way that is aware 
of the challenges they pose. We will focus on events with 2 or more jets plus large missing 
transverse momenta, for while jet-veto signatures from direct production of electroweak-inos 
can sometimes be important, the standard model background is generically too large for the 
parameter region we scan. 

The complete list of the signatures we use is given in appendix El There are two different 
types of signatures we consider, counting signatures and kinematic histograms. Counting 
signatures give the number of events that pass a certain set of criteria. Because getting an 
accurate measurement of <Jsusy is very difficult at the LHC, we only include one signature 
that counts the total number of SUSY events that pass the above cuts; all other counting 
signatures are given as ratios. The two types of kinematic histograms we generate are 
effective mass^ and invariant mass for various different objects in events: 




( 11 ) 


a 


We use a quantile method to dehne signatures for all of the kinematical distributions. 
The entries in a distribution are organized into bins of variable width such that each bin 
contain the same number of entries. For example, a decile distribution has ten bins which 
each contain 10% of the total entries. The signatures for a distribution are given by the 
boundaries of the bins, with no signature stored for the upper or lower boundaries of the 
total distribution. Therefore a distribution split into deciles has 9 signatures associated with 
it. Note that using the quantile method, the information of a distribution is in the positions 
of the bins, rather than the content of each bin. 

Our adoption of the quantile method is dictated by the necessity of dehning distribution 
observables which are applicable to a wide variety of models with very different mass spectra. 

^In other contexts, “effective mass” means the Pt sum over missing energy and the four hardest jets. In 
this paper, we use effective mass to mean any Pt sum over any number of objects. 
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The traditional histogram method of fixed bin size is not practical here since we would have 
to include histograms with large energy ranges and with many different bins. Because 
different parts of the histogram would be populated by different models, we would either be 
forced to store a large number of redundant signatures or make the bin size so large that 
important kinematic information could be lost. The quantile method resolves this problem 
by converting distributions into a small number of easy-to-use inclusive observables. 

In the SUSY literature, there is a large focus on endpoints and edges in kinematic distri¬ 
butions as ways of constraining mass differences between different SUSY particles |221E11E3 
OH. While it is difficult to teach a computer how to generically find an edge/endpoint (and 
assign appropriate error bars), the quantile method of describing histograms captures most of 
the statistically significant information in a kinematic distribution. So even though there is 
not a separate signature corresponding to an edge/endpoint measurement, the edge/endpoint 
will be well constrained by the requirement that all of the quantiles for the histogram match. 


4.2 The Scanned Parameter Space 


LHC signatures are mainly sensitive to mass parameters for the particles which either have 
large production cross sections or are important links in cascade decays. Considering both the 
strong constraint from FCNC measurements and the lack of sensitivity in LHC observables 
included in the current study, we have imposed universality conditions on the masses of the 
first two generations of scalar fermions. On the other hand, third generation mass parameters 
are treated as independent. Flavor off-diagonal entries in the sfermion mass matrices are 
assumed to vanish. Left- and right-handed sfermions are allowed to have independent mass 
parameters. 

We scan tan/3 and the following 14 SUSY mass parameters: 


Inos : Ml, M 2 , M 3 , /x 

Squarks : 

Sleptons : 


mi 


R 


We have separated left- and right-handed sfermions. The soft masses for the first two gener¬ 
ations are universal. For right-handed squarks, the up- and down-types are varied indepen¬ 
dently. In order to constrain the Higgs sector, we take the mass of the pseudoscalar Higgs 
to be 850 GeV. We also fix the third generation squark H-terms at 800 GeV. 

We scan the parameter space by randomly sampling the 15 dimensional parameter space. 
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Figure 13: Parameter Ranges for our General Scan. The gluino and the six squark masses 
are randomly sampled between 600 GeV and 1 TeV. The three electroweak-ino and the four 
slepton masses are randomly sampled between 100 GeV and 1 TeV. The value of tan/? is 
scanned between 2 and 50. 

We sample all mass parameters uniformly in mass value, and scan tan (3 uniformly from 2 to 
50. The range of mass parameters in our general scan is shown in figure ITT^ 

Note that we do not enforce the mass of the Higgs to be above the experimental bound. 
The reason is that in order to get a heavy enough Higgs in the MSSM, we would be forced 
into a small region of parameter space with large tan (3 and large stop masses. Because the 
LHG signatures of SUSY are dominated by colored particles and are largely independent of 
the Higgs sector, we do not want to limit the kinds of allowed SUSY signatures by placing 
artificial restrictions on the SUSY parameters, especially because the Higgs mass may be 
lifted due to an extended Higgs sector. 

We also impose a non-decoupling criterion to minimize the number of flat directions in 
the map from parameter space to signature space. Let be the heaviest slepton soft 

mass, rri^ be the heaviest electroweak-ino mass parameter (Mi, M 2 , or p), and let 
be the soft mass or mass parameter for the heaviest color-charged particle. We demand: 

™“p,on < + 50 GeV < + 100 GeV. (12) 

The purpose of the non-decoupling criterion was to ensure that apart from the 50 GeV 
buffer, there was always a kinematically allowed cascade decay involving sleptons in the 
sample. However, this non-decoupling criterion in no way guarantees that sleptons will play 
a dominant role in the decay chain. As we discuss more in sections El and [3 scanning over 
the MSSM, the total production rate for sleptons from cascade decays is generically small. 
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4.3 Comparing Models 


The most statistically sound method for testing whether a set of LHC observables matches 
a given model is to generate an “infinite” statistics sample of that model, and do a test 
to estimate the likelihood that the LHC data matches that model. Because we are focusing 
on testing whether there are SUSY models that share the same signatures with 10 fb“^ of 
data, we only need to find a measure of the difference between data sets, and then establish 
a threshold below which models are considered degenerate. 


We define a y^-like variable to measure the difference between two models 

= (13) 

'tsig j \ / 

where sf (sf) is the value of the Uth signature for model A (B), is the error bar assigned 
between models A and B for the i-th signature, and the sum over i runs over Usig relevant 
signatures. A relevant signature is one that would not artificially reduce ASab because of 
low statistics. We define a relevant signature to be one for which the error bar is smaller 
than both sf and sf or for which | — sf | > . The value of is given by 


CTi 


\ 


(^statS^) + (5statS 


B 


f + 


fi 



(14) 


where the statistical errors (fgtat are described in AppendixEl and fi is an additional fractional 
error parameter that could be used to estimate standard model background errors. For our 
study we take fi = .01 for every signature except the total number of SUSY events, for which 
we take / = .15. 


In order to figure out the value of (AS'ab)^ that defines the typical size of statistical 
fluctuations, we ran a subset of our models again with a different random number seed and 
calculated the AS"^ values between duplicated models. We define the cutoff as the 95th 
percentile of these AS'^ values, yielding (AS'gsth)^ = 0.285. 

One concern in using a y^-like variable to distinguishing models is that it does not account 
for the fact that if one signature differs by a large (> 5a) amount, the overall can be still 
be very small. This is especially a concern in our case where the total number of signatures— 
1808—is very large, and we discuss this concern in more detail in section IHl One could try 
using the condition that models are considered the same only if every signature is within 5a, 
however, using this criteria would force us to reject identical models generated with different 
random number seeds where one or two signatures differ by more than 5 ct because of a large 
statistical fluctuation. Another deficiency of a y^-like variable is that it does not account 
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for the fact that certain signatures are better than others at distinguishing between models. 
However, in trying to develop an optimally weighted {ASab)"^ variable, we have found no 
effective weighting strategy which will enhance the statistical signihcance of the difference 
between degenerate models. 


In order to quantify the distance between two models in parameter space, we dehne AP^ 


as: 


{APab? = 


n 


E 


''A R ' 

Pf - Pf 


para ^ 


p\ 


AB 


AB ^ pf + pf 

Pi o 


(15) 


where pf {pf) is the value of the i-th. parameter for model A {B) and the sum runs over 
^para parameters. Roughly speaking, AP^ gives the quadrature average of the percentage 
difference between model parameters. Depending on the context, this sum can run over all 
the parameters or just a subset. 


5 Characterizing the Degeneracies 

The pigeonhole principle argument tells us that there must be degeneracies, but it does 
not tell us what the degeneracies actually are. And it is clearly challenging to hnd an 
algorithm to systematically hnd all degeneracies associated with a given model, precisely 
because they are not “close” to each other and there is no obvious way to continuously 
travel between degenerate pairs. Plus, the existence of cliffs shows that it would be difficult 
to hnd local minima in signature distance because of the strong sensitivity of LHC observables 
to the SUSY parameters. Nonetheless, as we will see the degeneracies have rather simple 
characterizations, and understanding these will help in devising strategies for breaking the 
degeneracies with more signatures. 

The hrst important point is that the gross properties of the colored superpartner spectrum— 
the masses of the gluinos and squarks—are largely determined by our signals. In hgure El 
we examine the (AS*^, AP^) plot where AP^ only includes the gluino mass separation. This 
perfectly resembles the “best of all possible worlds” plot from hgure 0— it is beautifully tri¬ 
angular and there are no degeneracies for gluino masses. The analogous plots for squarks are 
also shown in hgure El Note that there is greater variation in AP^ for small AS"^, indicating 
that LHC observables are not as sensitive to individual squark masses as gluino masses. One 
of the reasons why the variations in squark masses are small is that squarks are scanned in 
a small range compared to sleptons and electroweak-inos. However, the mass of the lightest 
squark is still well constrained, suggesting that the jet signatures hx some overall scale for 
the squarks but not their havor or handedness. 
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Figure 14: A (AF^, AP^) plot for gluinos and squarks, where again AP^ ~ 0.01 roughly 
corresponds to 10% accuracy in the desired parameters. We see that the maximum variation 
in gluino and squark masses is roughly 10% and 40%, respectively. While not shown in these 
plots, the variation in the mass of the lightest squark (regardless of identity) never exceeds 
~ 15%. In the plot for squarks, the variation of the mass parameters is calculated as the 
average variation of individual squark mass parameters. 

The plot for slepton masses in hsTure ITHl is very different. Note that there is a continuous 
spread of slepton masses at small AF^. Evidently we are not particularly sensitive to the 
slepton mass over large ranges of parameter space; this is unsurprising because sleptons are 
not copiously produced in cascade decays in most of the regions of parameter space we have 
simulated. Similarly, tan (3 is generically unconstrained, because while the higgsino couplings 
are controlled by tan f3, movement of the third generation squarks can often compensate for 
changes in the branching ratio to higgsinos. 

The situation is different still with the electroweak-inos, as can be seen from hgure 
Here we see dramatic evidence for the existence of degeneracies—at small AS*^, the plot has 
two branches with small and large AP^. This is strong evidence for discrete ambiguities in 
the determination of electroweak-ino parameters. 

The existence of degeneracies in the electroweak-ino sector may seem counter-intuitive. 
After all, the electroweak-ino sector in the MSSM is made of particles from different repre¬ 
sentations of SU{2)l X P(l)y. Therefore, they couple very differently to matter multiplets 
and to standard model gauge bosons. For example, the bino will couple to both left-handed 
and right-handed states, while the wino only has left-handed couplings. The wino and the 
higgsino have both charged and neutral states—approximately degenerate in mass—as part 
of the same multiplet, while the bino only has a neutral state. 

Therefore, one might expect that LHC signature should be very sensitive to not only the 
masses, but also the identities of the electroweak-inos. Indeed, two otherwise identical mod- 
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Figure 15: A AP^) plot for sleptons and tan/?. We see that none of these parameters 

are particularly well constrained. The variation of the slepton mass parameters is the average 
variation of individual slepton masses. 
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Figure 16: A (AS*^, AP^) plot for electroweak-inos. In these plots we see the presence of 
two regions at small AS*^. There is a lower triangular region, in which the electroweak- 
inos are constrained to at most 20% - 40% variation. Then there is an upper region where 
the electroweak-ino masses appear unconstrained. This indicates the presence of discrete 
choices in the electroweak-ino spectrum. Note that the mass of the LSP mass is much better 
constrained than the mass of the individual electroweak-inos, showing that LHC signatures 
are sensitive to mass eigenvalues but not mass eigenvectors. 
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els with different electroweak-ino mass parameters produce quite different LHC signatures. 
However, our key observation is that such changes of the electroweak-ino mass parameters 
can sometimes be compensated by changes of other soft parameters. 

In general, we can attribute the existence of degeneracies to the following facts. First, 
in principle, the identity of final state quarks from SUSY decay chains carries a lot of the 
information about various intermediate superpartner states. On the other hand, except for 
the partial flavor tag of the third generation, all other information of the quantum numbers 
of final state quarks are lost. Second, most of the kinematical observables, such as Pt, are 
only sensitive to the mass splitting of the superpartners, not their identities. Third, there 
are large region of parameter space where the electroweak-inos have nearly identical decay 
modes, hence very similar signatures. For example, as long as the mass splitting between 
the electroweak-inos is greater than niw and sleptons are decoupled, the dominant decay 
mode of a chargino is almost always + LSP, relatively insensitive to the mixings 

of two chargino states. Finally, we do expect to get a better handle on the identity of the 
electroweak-inos if the decays through on-shell sleptons have significant branching ratios, 
due to the fact that leptonic signatures typically carry much more information comparing to 
jet signatures. On the other hand, without any theoretical preference, this is a very special 
corner of the MSSM parameter space. 

Notice that overall event rate, while the most statistically significant observable for mea¬ 
suring mass scales of colored particles, is not a sensitive observable to the more subtle 
structure of degeneracies involving electroweak-inos and sleptons. Nor is it sensitive to in¬ 
dividual movements of squark mass parameters, as long as some rough overall squark mass 
scale is fixed. 

The electroweak-ino degeneracies can be characterized in a number of simple ways, which 
we describe in the following subsections. 

5.1 Flippers 

Flippers are probably the most dramatic example of degeneracies in the electroweak-ino 
sector. In this case, although the mass eigenvalues of two sets of electroweak-inos remain 
approximately the same, the identities of two members of them are swapped. We observe 
that it is possible to make such a swap without significantly changing the signatures by 
adjusting somewhat the other SUSY mass parameters at the same time, such as left- and 
right-handed sfermions masses. 
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Flipper 



Figure 17: An example of a Flipper, where the masses of the electroweak-inos stay roughly 
hxed but their identity changes. In this example a wino LSP is replaced by a higgsino LSP. 


Cycler 



Figure 18: An extreme example of a flipper is a Cycler, where the electrweak-ino identities 
undergo a cyclic permutation. In model A, B < W < fx, whereas in model B, W < jx < B. 
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A simple example of a flipper is shown in fiEfnre IT71 In this pair of models, althongh the 
LSPs have approximately the same mass, their identities are wino and higgsino, respectively. 
Besides the LSP, other electroweak-inos are at most barely in the decay chain, becanse of 
the suppressions either from phase space or from an off-shell squark. The dominant channel 
of decay of gluino and squarks are directly to jets and LSP. Since we do not measure the 
charge of the jet, such signals reveal very little about the identity of the LSP. Notice also that 
the masses of the other squarks and sleptons moved in this example, compensating possible 
differences from a simple swap of the wino and higgsino. 

Another more dramatic example of a flipper—a “cycler”—is shown in hgure In 
this case, the identities of the three electroweak-inos in the degenerate models differ by a 
cyclic permutation. In each of these two models, there are two electroweak-ino states lighter 
than the gluino and squarks, and hence both are present in the decay chain. On the other 
hand, in the absence of a signihcant slepton branching ratio, the decay between the two 
electroweak-ino states is dominated by hPs, Zs and Higgses, and Higgs information is always 
less signihcant due to backgrounds, trigger bias, and tagging efficiency. Therefore, again, the 
lower stage of decay chain, which is the most relevant for understanding the electroweak-inos, 
are not very telling in this case. We will discuss this example in more detail in section IHl 

5.2 Sliders 

Due to the existence of a neutral massive particle escaping the detector, most of the LHC 
observables for SUSY are only sensitive to mass differences. Therefore, we would expect 
that the signatures would not change signihcantly if we shifted the mass spectrum while 
holding mass differences hxed. Because we also use the total rate as one of the observables, 
we expect that we will not be able to change the parameters a lot this way. The slide in the 
spectrum will have to be somewhat uneven in different parameters as well. 

An example of such a slider is shown in hgure The dominant production channel 
for these two models are gluinos. We see that the masses of a number of states change 
collectively in one direction, keeping the most important mass gaps, such as between gluino 
and the LSP, approximately hxed. 

5.3 Squeezers 

If the mass separation between the two states in the decay chain is very close, the decay pro¬ 
cesses between them will only generate very soft objects which are below detection threshold. 
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Slider 



Figure 19: An example of a Slider, where the spectrum is shifted up, keeping the mass 
difference among the electroweak-inos and gluinos roughly hxed. In this case, the shift is 
about 100 GeV for the most relevant parameters. 


Squeezer 



Figure 20: An example of a Squeezer, where the mass difference between electroweakinos 
is made small such that the decay products between adjacent states are too soft to detect. 
In this example, the LSP mass is hxed, but the number of neutralino states at low energies 
changes from 2 in model A to 1 in model B. 
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In this case, we expect to have little ability of telling them apart as different states. 

One example of such a squeezer is shown in hgureEB In model A, a wino state is almost 
degenerate with a bino state, while in model S, only a wino state light with about the same 
mass. One would expect to be able to tell the difference between these two models since the 
lower lying states have different ratios of neutral and charge states. On the other hand, it 
turns out, due to the structure of other soft parameters in these two models, the dominant 
decay to the light states only contain jets. Therefore, because of the charge blindness of the 
jets, we lose information about the charge of the hnal states. 

6 “Are There Really Degeneracies?” 

Strictly speaking, we base our analysis and counting of degeneracies on a global y^-like 
variable. It is well-known that doing a Hf with a large number of observables—1808 in 
our case—will be sometimes misleading. In particular, it is in principle possible that two 
models differ by a large amount in a few observables, hence are distinguishable, but can 
still have a small AS^ difference and be mistakenly identihed as a degenerate pair. This is 
certainly a valid concern, but we argue that this effect is not very important in our counting 
and characterization of degeneracies. 

First of all, the limitations of a fit are only apparent when there are a small number of 
observables that are dramatically different. The sort of situation that can arise is that two 
models agree on everything except one signature which is different by, say, lOcr. However, 
this is not likely to happen in our study—from the analysis of section we find that most 
of our observables are correlated and the actual number of independent signature is small. 
More specifically, if we assume there are ~ 10 independent observables in the set used in 
this study, then if we compare two models and one signature is off by 5a, then we would 
generically expect 10% of all the observables to also show signihcant deviations, making it all 
but impossible for that pair of models to have a small enough to be called degenerate. 

We can also examine the degenerate examples we have found and check for such an effect. 
We can compare, for instance, the maximum deviation in the signatures with dileptons, with 
the same deviation from simulating the identical model with different random number seed. 
Dilepton signatures are chosen here since they have very little standard model background 
after cuts and a significant difference in them will definitely break the degeneracy. 

In particular, we ask what fraction of the degenerate candidates in our database have at 
least one dilepton signature deviation greater than 5a, and whether that fraction is consistent 


with statistical fluctuations. Out of the 283 pairs in the general scan within the strict criteria 
of AS"^ < 0.285, 46 had some dilepton signal that differed by more than 5a, yielding a failure 
rate of 16%. But out of 2600 models run with different random number seeds, 611 had some 
dilepton signal greater than 5 ct, yielding a statistical expectation for a 23% failure rate. 
Therefore, we see that the failure rate is consistent with statistics, showing that generically 
our definition of AS"^ identifies true degeneracies. 

We could do such a comparison in other classes of signatures as well, but such a compar¬ 
ison will be less meaningful for several reasons. Signatures with fewer leptons typically have 
much larger standard model background. Therefore, the statistical errors we have used in 
our study are not realistic. In particular, we expect only large and qualitative differences in 
these signatures to be useful in distinguishing SUSY models. At the same time, compared 
with dilepton signatures, it is much easier to make small changes in the soft parameters to 
obtain a better fit to zero or one lepton signatures. For example, a large deviation in the 
total number of SUSY events can be compensated by a very small change in the gluino mass 
with negligible effects on the other signatures. 

To give a sense of how different the LHC observables are in a typical degenerate pair 
of models, we can look more closely at a specific example which has interesting swaps in 
electroweak-ino masses—the “cycler” example shown in section 15.11 There is no significant 
difference in the dilepton signatures for the cycler pair. Therefore, it is not at all obvious how 
we should distinguish them. This is also a pair of models where we do see some discrepancies 
in some of the other signatures, which are however unlikely to be useful to qualitatively 
distinguish the two models. 

A set of typical observables for this pair are plotted in figure Columns labeled by A 
and B are from the degenerate pair of models. Column C shows distributions from running 
model A again with a different random number seed, which provides a guide for the size of 
statistical fluctuations. Visually, we see that there is no difference in the counting of 6-jets 
or in the Mes distribution, despite the fact that those distributions showed some variations 
between models A and B (1 ct-3(t). Except for the signatures we discuss below, all other 
counting signatures and distributions showed comparable or smaller deviations. 

The shapes of the dielectron invariant mass distributions are somewhat different. On the 
other hand, the difference between models A and model B are at least as big as the difference 
between model A and model C, showing that at 10 fb“^, there is not much information in 
these dilepton invariant mass distributions. Whether there could be some smoking gun in 
this distribution at 300 fb“^ is unknown. 
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Figure 21: Example distributions from the “cycler” example of figure fTT^ Model A and B 
are a degenerate pair, while model C has the same parameters as model A but was generated 
with a different random number seed. Note that histograms are plotted in quantiles—equally 
occupied bins of variable width. With the exception of lepton charge and total number of 
events as discussed in the text, all counting signatures have roughly the same variation 
among the three models as the 6-jet counting example shown here. Similarly, all kinematic 
histograms have the same or smaller variation as the Mefj example. While the dielectron 
invariant mass distribution looks different between models A and B, it also looks different 
between models A and C, so we can ascribe such differences to statistical fluctuations. The 
main difference between models A and B is seen in the rate of Higgs production from the bb 
invariant mass distribution, though as we argue in the text, such a difference is washed out 
by ti standard model background and also depends on the details of the Higgs sector, which 
we made no effort to control in our study. 
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One potential difference in these two models is the bb invariant mass distribution. In 
both models, Higgs bosons are produced as part of the decay chain. In model A, Higgses are 
more copiously produced, and there is a 4a difference in the rate of Higgs production. On 
the other hand, it is very unlikely such a signature will be useful, even at high luminosity. 
First of all, at least with our cuts, such a deviation will not be visible once we include the 
standard model ti background. Moreover, such a difference is highly sensitive to the details 
of Higgs sector. If there is a signihcant modihcation of Higgs sector of the MSSM, such a 
including a singlet in the possible decay product of the Higgs, this deviation will almost 
certainly be most less prominent. 

The largest deviation in the observables between these two models is a somewhat large 
charge asymmetry in the single lepton signature, about 20%, corresponding to 7a-8a using 
only statistical error bars. However, distinguishing these two models based on this difference 
in single lepton signature will be very challenging due to the existence of large standard 
model background. At higher luminosity, one might expect the charge asymmetry to appear 
in trilepton events, but given the absence of sleptons in the decay chain, trilepton signatures 
will be relatively sparse. 

Furthermore, the charge asymmetry is not necessarily a robust qualitative difference 
between these two scenarios either. The dominant contribution to such a difference comes 
from squark-gluino associated production, which is in turn very sensitive to small differences 
in squark masses. Because the overall rate of SUSY events between models A and B differ by 
4a due to a 50 GeV shift in the gluino mass, we could hne-tune the gluino mass in one of the 
models to make the overall SUSY production rate more comparable, and such a hne-tuning 
(accompanied by other shifts in the squark sector) could alleviate the charge asymmetry 
without drastically modifying other observables. 

The cycler example is chosen here not because the pair has completely identical sig¬ 
natures, but because it exemplihes an interesting structure of degeneracy. The fact that 
two such drastically different models could come this close with observables extracted from 
only pure signal gives us the conhdence that they are good representatives of a degenerate 
scenario. 

Because of the ambiguity of the analysis, there is no reason to believe that the best 
examples of degeneracies should be within the signature cut of = 0.285. There are 
pairs with somewhat larger signature separation and yet with no qualitative difference in 
any particular signature. The apparent difference in total comes from adding up small 
deviations from a number of observables, which is expected due to the large number of 
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Farther Apart in Signature Space 



Figure 22: An example of a pair of models whose signature distance {AS^ = .457) is larger 
than the cutoff dehned in this paper for degenerate models. While these models can be 
distinguished by our fif; if is possible that by slightly changing the parameters in this 
model, a robust degeneracy could be formed. This pair qualihes as a flipper, in that the 
identity of the second electroweak-ino is changed from a bino in model A to a higgino in 
model B. 

observables. Such pairs may actually be a better representative of a qualitative degeneracy, 
and we expect that nearby models in parameter space would form degenerate pairs satisfying 
a stricter degeneracy criteria. 

Such an example is shown in hEfures 1^ and EHl This pair of models is a flipper degeneracy 
where the second-lightest electoweak-ino changes from a bino to a higgino. While there are 
many la variations in numerous signatures, there is no qualitative difference between these 
two models in any of the inclusive signatures. The most noticeable difference is an 8% 
difference in the inclusive one lepton counting, whose effect is somewhat visible on the plot 
in hgure |2S1 There is also a 9% difference in the counting of 2 jet with no lepton events. 
Such deviations will almost certainly be swamped by large standard model background. 
There is no charge asymmetry in one lepton events, nor evidence for different rates of Higgs 
or Z production, nor any other potentially qualitatively different structure in all the other 
signatures. Such an example suggests that in the vicinity of pairs of models that lie a 
moderate signature distance apart, one could arrange robust degenerate pairs by making 
small parameter variations to smooth out la differences. 
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Figure 23: Example distributions from the pair of models in figure [221 Like figure ITTI model 
C has the same parameters as model A but was generated with a different random number 
seed. The most noticeable difference in these plots is a slight imbalance in the number of 
3 jet vs. 4 jet events with 1 lepton present. All of the other distributions are qualitatively 
similar between models A, B and (F; a typical variation is shown for a Mgfj plot. While a 
slight bump in the dielectron invariant mass distribution is present in model B at 300 GeV, 
the significance of such a feature is called into question by the presence of a similar bump in 
model C at 500 GeV which is entirely absent from model A, despite the fact that they share 
the same underlying parameters. 
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Figure 24: The parameter ranges for our dedicated cascade decay scan. Unlike the previous 
scan, we assume flavor universality for the sleptons and squarks, but we will still split left- 
and right-handed sfermions, and up- and down-type squarks. We force that the heaviest 
slepton is lighter than at least one electroweak-ino. As before, tan/? ranges from 2 to 50. 

7 Sleptons and Long Cascade Decay Chains 

Part of some people’s optimism about making precision measurements at the LHC comes 
from studying leptonic signatures. Not only is the standard model background smaller for 
events with hard jets and missing energy and hard leptons, but leptons, with their charge 
and flavor identihed, carry much substantial information about the underlying processes. 
Moreover, the energy resolution on electrons and muons is much better than for jets. So 
while colored particles in SUSY guarantee that SUSY cross sections will be large at the LHC, 
it is leptons from cascade decays that are entrusted with constraining the SUSY spectrum. 

In constrained models like mSUGRA or GMSB, we have reason to believe that slepton will 
be light, and therefore a reason to expect electroweak-ino-slepton-electroweak-ino cascade 
decays. Notice that the parameter region where we have a large slepton production in the 
decay chain is quite special. It certainly requires electroweak-inos, in particular those with 
large gaugino fractions, to be heavier than the first two generations of sleptons. At the same 
time, it also requires those inos to be light enough so that they are in the decay chain from 
gluinos and squarks. This set of requirements enforce a well-ordered mass spectrum from a 
general MSSM point of view. 

In order to explore this special situation better, we have conducted a dedicated scan, as 
shown in figure 1^ in which we enforce the existence of some long decay chains. In particular, 
the heaviest slepton is required to be lighter than at least one of the electroweak-inos. Also, 
we do not split the third generation of squarks and sleptons from the first two generations, 
with the idea that in most theoretical models with long cascade decays, there is usually some 
kind of flavor universality. 
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Inos, Squarks, and Sleptons Just Inos 
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Figure 25: A (AS*^, AP^) plot analogous to figure [71 where we see no evidence for degenera¬ 
cies. The between identical models is now AS"^ = 0.258. The left plot indicates that 
mass parameters can shift by at most 20%. The right plot shows no evidence for bimodal 
behavior in the electroweak-ino sector. 

We simulated m = 27196 models, and still found a number of identical doubles, W = 56. 
This gives us an estimate for the total number of bins in signature space 

Nsig ~ m^/N2 ~ 6.6 X 10® (16) 

which is somewhat larger than in our previous example, despite the smaller range of param¬ 
eter space scanned. This clearly indicates that new leptonic directions in signature space are 
being opened up and we should expect fewer degeneracies. 

Indeed, of the 56 degeneracies we found, none correspond to reasonably “different” mod¬ 
els, which by equation yields 

(d) ~ 1. (17) 

The pigeonhole principle argument also gives us an estimate of the number of degeneracies in 
the cascade decay with on-shell sleptons. Suppose we could have the following accuracies on 
the measurement of soft parameters: gluinos to 50 GeV, both left- and right-handed squarks 
to 75 GeV, both left- and right-handed sleptons to 75 GeV, and electroweak-inos to 50 GeV. 
This gives us® 

A^modeis ~ 2 X 43 X 6' X 10^ ~ 5 X 10® ~ Wig, (18) 

So by equation ®, we should be left with some 0{1) number degeneracies in this case. 

The reduction of the number of the degeneracies in this scenario can also be be seen 

in the {AS‘^,AP‘^) plots shown in hgure |2S1 More quantitatively, the number of expected 

®Again, these values are chosen by estimating the local variation in mass parameters from (AS'^, AP^) 
plots for our cascade run. However, the separate plots for gluinos, squarks, sleptons, and electroweak-inos 
are not shown in this paper. 


35 






Degeneracies vs. Parameter Acceptance 
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Figure 26: Number of degeneracies as a function of allowed fractional error in cascades run. 
Triangles correspond to imposing the fractional acceptance on gaugino, higgsino, and squark 
parameters. Dots correspond to just imposing fractional acceptance on the inos. We see 
that the number of degeneracies asymptotes to 1, indicating that the only “degeneracies” we 
have observed are coming from uncertainties in mass measurements and not from discrete 
choices in the spectrum. 

degeneracies as a function of fractional differences in the parameters is shown in hgnrel^ As 
we discuss more below, the reduction in the number of degeneracies is mainly due to the fact 
that with long decay chains, it is in general much harder to hnd flippers in the electroweak- 
ino sector. Notice also that the number of degeneracies including squarks in hgure EEl is 
also much smaller compared with our general scan. This is partly due to the fact that we 
have chosen the three generations of soft masses to be universal, but equally important, our 
better handle on the identities of the electroweak-inos from sleptons translates into a better 
handle on the left-right splitting of the squarks. 

The main reason that the presence of the sleptons signihcantly reduces the number of 
degeneracies is their role in decays of the type shown in hgure EZl Without sleptons in 
the decay chain, the electroweak-ino decays are typically dominated by W/Zs —the left 
panel of hgure |23— and Higgses. Because h ^ bb typically carries less information due to 
backgrounds and tagging efficiencies, the only robust handle on electroweak-ino decays comes 
from leptonically decaying Ws and Zs. However, this handle is of limited value, because we 
not only lose statistics because of the small leptonic branching ratio of WjZ, but we also lose 
information about the electroweak-ino identity because the leptons are not directly coupled 
to them. 

On the other hand, the situation is signihcantly better with (on-shell) sleptons in the 
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Figure 27: The two major processes that can lead to correlated same-flavor opposite-sign 
dileptons in SUSY cascade decays. When dileptons come mostly from on-shell Z bosons as in 
the left diagram, then crucial information about the masses and identities of the neutralinos 
is lost. Only when sleptons are copiously produced in cascade decays as in the right diagram, 
do we gain a strong handle on the identities and mass splittings of neutralino spectrum. 

decay chain. The leptonic branching ratios as well as the charge of the leptons are sensitive 
to the identities of the electroweak-ino states in decay processes of the type shown in the right 
panel of figure EH In addition, slepton edges ISni El E3 EZI give a strong constraint on the 
mass difference between electroweak-inos, further limiting the possibilities for degeneracies. 

Still, we expect that in some cases it should be possible to flip the identities of the 
electroweak-inos in combination with flips of the other sfermion states. One example of such 
a flipper is shown in figure EHl though strictly speaking this pair of models does not satisfy 
the degeneracy criteria of section 14.dl In this example, the left- and right-handed squark 
states flip in order to accommodate a flip of the bino and wino. On the other hand, this is 
not a perfect example, since the sleptons are still somewhat decoupled except through small 
mixings between the higgsinos and the gauginos. 

If there do exist left-right flippers involving sleptons, we expect them to be subtle since 
they have to reproduce a large amount information contained in the leptonic signatures. 
In the case that non-degenerate left- and right-handed sleptons are produced in cascade 
decays, it would be extremely difficult to flip the wino and bino around the slepton states; 
a scenario with heavier bino will have two edges in the dilepton invariant mass spectrum, 
while a scenerio with a heavier wino will only have one edge. 

On the other hand, there could be other scenarios in which we could have a flipper. Two 
of those possibilities are shown in figure EHl It is possible that sleptons in the decay chain 
have near universal masses, as shown in the left panel. It is also possible the left- and right- 
handed sleptons are playing different roles in the bino-wino flip case, as shown in the right 
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Left/Right Swap 



Figure 28: An example of a Left/Right Flipper. Though this example does not fall within 
the {ASy cut that dehnes typical statistical fluctuations, the models are sufficiently close in 
signature space that slight adjustments in the colored sector might make this a real degen¬ 
eracy. In this example, the left- and right-handed sfermions switch places to accommodate 
a bino-wino flip. 

Cascade Left/Right Swap? 




Model A (GeV) Model A (GeV) 


Figure 29: Possible examples of Left/Right Flippers involving copious slepton production. 
In order to avoid a double slepton edge from bino decays, either the sleptons must be nearly 
degenerate, or only one handedness of slepton can be involved in the decay. 
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panel. In both of these cases, a double edge jl2l Hll would not exist, but it may be the 
case that these models could be distinguished by a subtle difference between dilepton and 
trilepton events. Verifying the existence of such degeneracies is a challenging task. 


8 Future Directions 

As mentioned in section 0 we have made two main compromises in our study of the inverse 
map by not including standard model background and by only simulating 10 fb“^ of data. 
In order to fully realistically assess the potential of measuring SUSY parameters at LHC, 
these two issues must be addressed. 

Various signihcant components of standard model backgrounds to SUSY signatures— 
such as di-boson, ti, and W/Z+]ets —have been studied for supersymmetry searches |2Z1I2H]- 
The presence of standard model background will no doubt worsen the sensitivity of LHC 
observables, in particular pure jet signals, to soft parameters. Therefore, it is crucial to 
study the extent to which it will change the number or structure of the degeneracies. 

Similarly, the LHC is going to achieve a much larger luminosity than we have simulated. 
With a higher luminosity of ~ 300 fb“^, we could in principle introduce more stringent cuts to 
purify the signal without losing statistical signihcance. Therefore, compared with our study, 
a full LHC data set with high luminosity will signihcantly reduce statistical fluctuations, 
and a fully realistic characterization of the inverse map will have to take this into account as 
well. In addition, a number of processes that can only be observed with larger rates might 
be valuable in removing degeneracies. 

The list of observables in appendix El are by no means exhaustive, nor are the selection 
cuts completely optimized. Our study shows that in order to break degeneracies at the 
LHC it is very important to construct new independent observables, and it may be possible 
to optimize the event selection criteria, or use multiple selection criteria, to increase the 
effective dimensionality of signature space. Furthermore, observables such as — 2, dark 
matter detection and density, and perhaps Bs —> depend on on the same parameters 

in different ways, and might be important in removing degeneracies. They could simply be 
added as additional signatures for each model. 

One of the major reasons for the existence of degeneracies is the loss of information due 
to the hadronization and the formation of jets. One possible direction that deserves much 
more careful study is how much, information about the initial parton we can get out of a 
jet in addition to its four-momentum, by including the information such as track charge 
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weighted by energy |13], or better observables for the same physics. A measurement of jet 
charge with any signihcant conhdence would be very effective in removing degeneracies if it 
were possible. On the other hand, due to the large statistics of the jet signatures, even some 
small preference at the percentage level could help in principle. 

Supersymmetry events usually have a large number of jets from different decay chains, 
as well as from different parts of the same decay chains. Therefore, even with a pure signal 
sample, important structures in the jet kinematical distributions are usually swamped by 
combinatorial backgrounds. It will be necessary to develop more sophisticated methods of 
pairing up the correct combinations of jets if we are to exact useful information about decay 
kinematics. 

Because leptonic signatures are so informative, it would be helpful if we could get as 
many leptonic events as possible. It is possible to enhance the leptonic signature by using 
smarter cuts in special cases. For example, in certain scenarios, leptons could tend to be 
very hard while the jets are softer. In this case, a better selection criteria would focus on 
events with hard leptons instead of imposing such strong cuts on the jets. Careful study of 
using different cuts for each of two or more leptons could be useful. 

Since we expect the sizes of degenerate islands to be small, and because the dimensionality 
of the parameter space is large, it is very difficult to scan densely and simulate a sizable 
sample of data at each step. One possible method for identifying degeneracies is to carry 
out a leading order scan by matching a limited set of important rate x branching ratios to 
narrow down potential regions. Such a scan will not require event simulation and hence is 
much less time/resource consuming. After such an initial scan, we could then start to scan 
and simulate in a much narrower region in order to search for and identify degeneracies. 

A particularly interesting example of a degeneracy study would be based on the mSUGRA 
model SPSla jlH]- This model has an on-shell slepton in the decay chain and hence strong 
multi-lepton signals. Based on our study, we should have less freedom in such a scenario 
to move the soft parameters around—especially the slepton and electroweak-ino masses— 
without producing large difference in observables. On the other hand, even in this case, we 
still expect to see more delicate flippers which combine the flipping of the bino and wino 
with shifts in the sfermion masses. Therefore, it would be interesting to do a dedicated study 
to hnd generic SUSY models degenerate with SPSla. The most important outcome of such 
a study would probably be a systematic method for mapping out degeneracies. 

Another way to understand degeneracies is to try to determine the shapes of the small 
islands in parameter space. Though we found in our study that the size of regions in the 
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parameter space corresponding to a signature bin were small, if we had more details about 
the shape of the small islands in each direction of parameter space, it would help us in 
mapping out the set of solutions matching LHC data. One such study could be based on 
parameter excursions around a generic set of MSSM models and one could measure the rate 
that signatures change in different parameter directions. 

Measuring the difference between left-handed and right-handed squarks turns out to be 
quite difficult. We expect the overall rate will fix roughly some overall scale of the squarks. 
Although the structure in left-right squark degeneracies is much less prominent than the case 
of electroweak-inos, such a splitting is not expected to be a flat direction either. Therefore, 
details of a left-right squark degeneracy remain one of the important directions that need 
further exploration. 

Of course, the statistical method we have developed in this study is completely general 
and applicable to any class of models. It is important to repeat the same analysis not only 
within a class of models for new physics but also between such classes. In particular, it will 
be interesting to observe the inverse maps of models with similar gauge quantum numbers to 
SUSY, such as universal extra dimensions or the little Higgs with T-parity. Understanding 
robust distinctions between those models and supersymmetry is one of the most important 
questions in interpreting LHC data. 


9 After a Discovery at the LHC 

If new physics is discovered at the LHC with a pattern of signals roughly consistent with 
SUSY, the most pressing challenge will be to invert the LHC signatures and extract infor¬ 
mation about the soft supersymmetry breaking parameters. 

An obvious first step is hnding any model that hts the set of LHC signatures that 
have been measured, presumably starting with a reduced set of SUSY parameters. Our 
study shows that the success in hnding one model which hts the data is not the end of the 
process, though. In generic regions of SUSY parameter space, if there are not signihcant 
additions to the list of signatures we have used, there will be diherent models which produce 
indistinguishable signatures. The number of degeneracies is not intractably large, suggesting 
that degenerate models may be distinguishable with more detailed study. 

Therefore, the most important task after hnding one model that hts the LHC data is to 
create a catalog of all models which have signatures consistent with observation. The reason 
is that any detailed study would presumably require the construction of special observables 
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designed to distinguish specific models. Due to the small size of the islands in the parameter 
space occupied by degenerate models, and the large volume in which those islands are scat¬ 
tered, mapping out degeneracies is challenging. The naive approach of carpeting parameter 
space will not work in this case because of the large amount of necessary computation. 

Clearly, new methods and insights need to be developed for the purpose of identifying 
potential degeneracies. However, our characterization of the possible degeneracies as flippers, 
sliders, and squeezers is a useful first step in finding other candidate models. Then again, even 
though there are simple characterizations of the degeneracies in the electroweak-ino sector, 
we saw that other soft parameter had to shift to accommodate such changes, complicating the 
task of manufacturing degeneracies. There have been some attempts to match parameters 
of supersymmetry breaking with LHC data 1211 EH ES EH, and it would be interesting to 
see whether these approaches can be modified for the purpose of finding degeneracies. 


10 Discussion and Outlook 

In two years, the LHC will begin to answer questions that have driven much of the theoretical 
activity in our field for the past three decades. Thinking about how we will go from LHC 
data to the underlying model now will help us get to the physics we really care about 
as quickly as possible, and may be able to improve experimental settings and procedure. 
In this paper, we have initiated a systematic study of the LHC inverse problem, within the 
context of the supersymmetric standard model with minimal particle content but (relatively) 
unconstrained parameter space. We have used simple statistical techniques to probe this 
map, in particular studying the average number of models with equivalent LHC signals, and 
the effective dimensionality of the signature space populated by SUSY. 

In the regions of parameter space where the sleptons are not produced in long cascade 
decay chains, we find that there is very little handle on the slepton masses, and degeneracies 
in the electroweak-ino sector. The typical number of degeneracies for any given model is of 
order (d) ~ 10 — 100. We have shown that these degeneracies have simple interpretations. 
For instance, in the decay of X 2 fo Xi: we can have “flippers” where {x 2 y Xi) cire either {B, W) 
or (lU, H), with accompanying changes in the rest of the spectrum to match LHC signatures. 
With sleptons produced on-shell in cascade decays as is the case in many of the well-studied 
mSUGRA models, but perhaps not in nature, the situation is better, but there may still be 
possible degeneracies involving left/right swaps. 

Our study of the inverse problem reconciles two orthogonal views one often hears about 
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what the LHC can determine about weak-scale physics. There is the school of thought that 
says that the LHC is only a discovery machine, but that any more precise determination of 
the underlying physics must await the construction of a linear collider. Another school of 
thought holds that not only can the LHC discover new physics, it can also determine model 
parameters to few percent accuracy! Our picture of the inverse map shows the sense in 
which both of these pictures can be correct. There can indeed be a relatively large number 
of different models compatible with LHC data, partially justifying the hrst view, but each of 
these is a small island in parameter space, partially justifying the second view. Our result 
for the number of degeneracies ~ 1 — 100 is as interesting as it could have been, however. 
The number is not 10®; it is just large enough to represent a non-trivial challenge and just 
small enough to spur us to think of clever new signatures to resolve the small number of 
ambiguities in SUSY. And it is easy to determine whether a new set of signatures is effective 
in enlarging signature space—we simply add the signature and compute the new average 
number of degeneracies {d). 

Obviously, the number of degeneracies will be smaller if a more restrictive parameter 
space is chosen. Again, this can be looked at by repeating our analysis for the restricted 
models and computing (d). If (d) ~ 1, then if such a simple model reproduces LHC data, it 
is not likely to have a degenerate pair within its own model space, and despite the fact that 
it may have ~ 1 — 100 degenerate pairs in an enlarged parameter space, it would clearly be 
preferred over other generic points. It is quite interesting that tan/? is difficult to measure 
in general, and it is challenging to hnd signatures that are sensitive to tan (3 alone. Perhaps 
when information about the superpartner spectrum is known from the LHC, tan (3 could be 
determined with other information that is very sensitive to tan (3, such as the higgs sector, 
Q/i — 2 and Bs constraints. 

We have illustrated our approach to the inverse problem in the context of low-energy 
SUSY, but the same ideas can be applied to any theory of physics beyond the standard 
model, including theories of extra dimensions with KK parity and of little Higgs models 
with T-parity, which have very similar signatures to SUSY. Indeed, SUSY is likely a more 
challenging example due to its large parameter space. It would be interesting to study the 
inverse map in other models and between other models, study how the model footprints 
differ in signature space, and test ways of distinguishing qualitatively different possibilities 
for new physics at the LHC. Furthermore, our entire discussion has been at the electroweak 
scale in four dimensions. Once the 4D effective lagrangian is determined at the weak scale, 
we can begin probing the underlying higher scale or higher dimensional physics. 
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A N-tuples and Demonic Bins 


In section 12. Jl we derived a value for N, the number of experimentally distinguishable out¬ 
comes at the LHG, making the assumption that each signature bin is equally likely to be 
occupied. We can improve on this very simple picture. For instance, it could be that some 
fraction of bins are “demonic”, in the sense that a disproportionate number of balls land 
into them. We can model this by saying that a fraction x of balls land into a fraction A of 
the bins, while the remaining fraction (1 — x) of the balls land into the (1 — A) fraction of 
bins. The number of expected p-tuples of balls in the same bin is then 
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( 19 ) 


We can fit the observed Np to determine N,x,X. 


In our sample of with m = 43026, we have 283 doubles, 32 triples, and 3 quadruples, 
which is consistent with 


N = 3.19 X 10^ a: = 0.0317, A = 0.000114, (20) 

tellling us that 3.2% of models map to 0.01% of signature space. However, with such low 
statistics it is difficult to say whether this picture is correct, because we are fitting three 
numbers with three parameters. To really check the picture, we can artihcially increase the 
statistics by going to a larger signature cut to dehne degenerate models. If we increase the 
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Doubles 


Triples 




m m 



Figure 30: The fits of Np to equation (uni). The m axis indicates the number of models 
simulated, and in order to make smooth curves, 1000 random choices for the m models were 
averaged for each data point. The number of p-tuples is consistent with the picture that 
4.7% of models map to 0.11% of signature space, indicating that while 5% of models are 
much more likely to have degeneracies than generic models, the phenomena of degeneracies 
is not completely dominated by small regions in parameter space. 


number of doubles by a factor of 20, this gives us a sizable number of p-tuples, and we can 
now fit the Np curves as a function of m. Because we have 43,026 models to work with, we 
can average over many different subsets containing m models to get a smooth curve for Np 
as a function of m. 

The curves shown in figure 1201 are for 

N = 4.65 X 10^ X = 0.0473, A = 0.00113, (21) 

which tells us that 4.7% of the models map to 0.11% of the signature space. Because we 
have artificially increased the number of doubles by a factor of 20, the real value of N should 
be 20 times bigger than the value in equation m, giving us 

N = 9.3x 10®, (22) 

which is roughly compatible with equation (PU|). 
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B Complete List of LHC Observables 


In this section, we summarize the LHC signatures we have included in our study. The signa¬ 
tures can be divided into two categories: counting signatures and kinematical distributions. 
Both types of signatures depend crucially on the cuts and triggers we impose to isolate a 
signal sample. 


B.l Cuts and Triggers 


We identify our signal sample through a set of event hltering triggers and selection cuts. 
To account for the effect of multiple interactions and initial state radiation (which yield 
additional soft objects in events), we only keep objects in the event record if 

photon, leptons: Pt > 20, |? 7 | < 2, jets: Pt > 50, |? 7 | < 3. (23) 


We then select events with two or more jets subject to the criteria that 


]^j' > 150 GeV, 


Ht > 


600 GeV 0 or 1 lepton in event 
400 GeV 2 or more leptons in event ’ 


( 24 ) 


where 

Ht=^t+Y.Pt- (25) 

all jets 

These cuts were not optimized to maximally enhance signal over standard model background. 
However, the number of standard model events that pass these cuts are reasonable—a 10 
fb“^ ti sample contains around 76000 events that satisfy the above criteria out of 4.9 x 10® 
events total. 


B.2 Counting Signatures 

Gounting signatures record the number of a specihc kind of events based solely on their 
object content. The most inclusive counting signature is the total number of events that 
passes the trigger and cuts. All other counting signatures are represented as ratios to the 
total number of events or as ratios to each other. 

The only distinction between jets is whether they are &-tagged.’^ Therefore, we count 
jet multiplicity in two categories, 6-jet and non-6-jet. On the other hand, the leptons carry 
both charge and flavor information, so we keep separate counts for each charge and flavor 
^Strictly speaking, the 6-tag algorithm in PGS is a heavy-flavor tag and thus includes some c-quark jets. 
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configuration. Supersymmetric events could have multiple photons associated with them 
both from initial and hnal state radiation, as well as decay products. Therefore, a typical 
classihcation of a counting signature could be represented as 


jets( 7 ^non- 6 -jet, 7 ^ 6 -jet) + leptons(#, charge, flavor) + # 7 + ^t- 


( 26 ) 


Of course, we could form more inclusive observables by combining several categories together. 
For example, we could have an inclusive signature for the number of events with 1 lepton 
(including all possible charges and flavors) and 2 non- 6 -jets. 

Now, we list the counting signatures that we have implemented. Where appropriate, we 
use the following shorthand to dehne ratios. 


Y Number of events with X and Y 
X Number of events with X 


(27) 


1. Total number of events. 

The sum of all events that passed the selection cuts. 

2. Jet and lepton counts. 

Events are categorized according to the number of leptonic objects in the event. All 
counts here are charge and flavor inclusive. For each lepton number category, the 
counts are further separated according to number of jets and 6 -jets in the event. The 
signatures are: 

He leptons Uj jets Ub 6 -jets 


rij jets 


Ub 6 -jets 


total events’ leptons’ leptons’ 


where 


= 0,1, 2,3,4+, nj = 2, 3,4, 5, 6,7,8+, n?, = 0,1, 2, 3,4, 5, 6 +. (29) 

Note that Uj jets includes both 6 -jets and non- 6 -jets. 

3. Lepton flavor and charge counts. 

For events with leptons, flavor information is used to further divide signatures into 
sub-categories. We have included here only the overall counting of the different fla¬ 
vor/charge categories without regard for jet multiplicity. In all cases, the signature 
used is 

(30) 


flavor category 
ne leptons ’ 


where the flavor categories are: 
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• 6 possibilities if {i = e,/i, r). 

• 2L 21 combinations iflf- 

• 2>l. For events with 2 same flavor opposite charge leptons lfl~lf, we categorize 
them according to the flavor of It and the flavor/charge of if. (18 combinations.) 
For trilepton events without same flavor opposite charge pairs, we give the total 
charge (1±,3±) and the dominant flavor (e,p,r). (12 combinations.) When the 
dominant flavor is ambiguous, we assign the dominant flavor to be r. 

• A'^l. We do not include flavor/charge information in this category. 


4. Photon counts. 


?7,.y photons 
total events ’ 


n.y = 0 , 1 , 2 +. 


(31) 


B.3 Kinematical Distributions 

Generally, separate distributions for certain kinematic variables can be implemented accord¬ 
ing to the object content of the event, as in equation For a lot of inclusive distributions 
we have used in this analysis, we use the number of leptons, jets, and 6-jets to label the 
distribution. 

The kinematical distributions are divided into two broad categories: Pt (those based on 
transverse momentum sums) and ruinv (those based on Lorentz-invariant four-vector sums). 


B.3.1 Transverse Momenta: Pt 

In this category, all kinematical observables has the form 

i: (32) 

{a} 

where the sum is over a set of specific objects of interest. Distributions are always binned 
into deciles, and the boundaries of the deciles (excluding the overall lower and upper bounds) 
are the signatures we store. 

The most commonly used observable in this category is effective mass, Meg EH 123 EH ESI- 
Instead of defining Meg only for the jet signatures, we define such an observable for any large 
class of event objects, 

MeS =^T + X! 

all objects 
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Notice that instead of summing the Pt of just the jets, we sum the Pt for all objects in the 
events. We separate into 12 different distributions labeled by the number of jets and 
leptons in the event: 


Hj = 2, 3,4, S’*", = 0,1, 2~^. 


(34) 


In particular, only includes events with exactly rij jets and leptons. 

We have also implemented inclusive distributions of jets, leptons, photons, and 

• Inclusive Pt distribution of the n-th hardest jet, where n = 1,2, 3,4. Distri¬ 
butions are categorized according to number of leptons (n£ = 0,1,2’'') but include all 
events with at least n jets. 

• Pt for all events with leptons. Separate distributions are used for different lepton 
flavor, but there is no charge separation, nor separation by the number of jets in the 
event. 

• Pt for all events with photons. 

• ^T for all events that pass our trigger. Separate distributions are used for different 
number of leptons {jii = 0,1, 2+). 

B.3.2 Invariant Mass: ruinv 

In this category, all kinematical observables have the form 



(36) 


Invariant mass distribution typically carry important information about the masses of the 
massive particles in the decay chain. They have been used extensively in the past in studies 
for measuring soft parameters [SnUSHESlEII. We have implemented in our study a collection 
of invariant mass distributions which, if well populated, are sensitive to all important masses 
of superpartners. Unless otherwise indicated, invariant mass distributions are binned into 
deciles. 

1. All objects. 

Invariant mass for all the objects in the event are used in there distributions. Separate 
distributions are created according to number of lepton and jets in the event: 


Hj = 2, 3,4, S’*", n£ = 0,1, 2~^. 


(36) 
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2. Jet objects. 

Invariant masses of various combinations of jets are used, if such a combination is 
possible in the event. Events are further divided into separate distributions based on 
the number of leptons in the event = 0,1, 2’''). We use the following shorthand 
notation in events with many jets: h refers to the hardest jet regardless of flavor, n 
refers to the hardest non-6-jet; b refers to the hardest jet with a 6-tag. In all but 
the di-6-jet invariant mass distribution, these distributions are inclusive of the total 
number of jets; for example, the invariant mass distribution for the two hardest jets 
includes all events with at least two hard jets. 

• mhh- Two hardest jets. 

• fnhhh- Three hardest jets. 

• 'n^hhhh- Four hardest jets. 

• rrinn- Hardest pair of non-6-jets. 

• rribn- Hardest 6-jet paired with hardest non-6-jet. 

• 'rubun- Hardest 6-jet with two hardest non-6-jets. 

• rribb- Pairs of 6-jets in events with exactly 2 6-jets, binned in 20-tiles. 

• uibbri- Two hardest 6-jets with hardest non-6-jet. 

• ^bbnn- Two hardest 6-jets with 2 hardest non-6-jets. 

3. Events with one lepton. 

For events with a single lepton, we construct invariant mass distributions involving the 
hardest jets {h) and hardest 6-tagged jets (6). All distributions are separated by lepton 
flavor but not by charge, and placed into 20-tiles. 

• m£^h^, 2 . Separate distributions for combining the single lepton with the hrst or 
second hardest jet. 

• miy, 2 . Separate distributions for combining a single lepton with the hrst or second 
hardest 6-jet. 

4. Events with same-havor opposite-sign dileptons. 

For events with dileptons, we have included various invariant mass distributions, many 
of them similar to those studied in |341 IHK] . The existence of dileptons indicates a long 
decay chain with at least two jets from earlier stage of the same decay chain. Various 
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pairing of the leptons and jets are sensitive to a variety of combinations of the masses 
of the superpartners involved—squarks, electroweak-inos, and possibly the sleptons. 

All dilepton invariant mass distributions refer to events with same-flavor opposite-sign 
dileptons and no other leptons, with separate distributions for different dilepton flavors. 
All distributions are binned in 20-tiles excepted where noted. 

• mu. Dilepton invariant mass distribution. Dilepton pairs whose invariant mass 
falls in the range ±5 GeV of mz are subtracted and histogramed in a separate 
distribution. The ratio of the number of events in the “Z-window” to the total 
number of dileptons is used as an additional signature. 

• mu^i,. Pair of leptons with the hardest &-jet in events with at least 1 6-jet. 

• ^ei,hh- of leptons with the two hardest jets. 

• /jhigh.iow. Combination of hrst or second hardest jet with the pair of leptons. 
High (low) are dehned by the combination with the larger (smaller) invariant 
mass. The jet associated with the high (low) combination is labeled as high (low) 
for the distribution below. 

• Four combinations of combining the high or low jet with 
each lepton, choosing the lepton that gives rise to the larger (m*^'®*^) or smaller 
(m*°") invariant mass. 

The following histogram are for events with exactly 2 6-jets and dileptons. Because 
these histograms are generically less populated then the ones above, they are binned 
only in deciles. 

• 'fTT-bblee 66 invariant mass distribution in dilepton events. 

• rubbu- Pair of leptons with both 6s. 

• 6-6-lepton invariant mass, where high (low) refers to the lepton that 
gives the largest (smallest) invariant mass. 

• fehigh.iow. Combination of hrst or second hardest 6-jet with the pair of leptons. 
High (low) are dehned by the combination with larger (smaller) invariant mass. 
The jet associated with the high (low) combination are labeled as high (low) for 
the distribution below. 

• Four combinations of combining the high or low 6-jet with 

each lepton, choosing the lepton that gives rise to the larger or smaller 

(m*°") invariant mass 


51 



5. Di-photon invariant mass distribution, in deciles. 


C Technique for Assigning Statistical Error Bars 


To assign error bars to our signatures, we use Gaussian statistics supplemented with 0{1) 
scale factors. We ignore systematic errors and errors from standard model background 
subtraction. Using a detector simulator, the best way to determine statistical errors is 
to run a model many times at fixed integrated luminosity to figure out the appropriate 
conhdence intervals on a given signature. This is very computationally expensive, so we will 
use a simple method to mock up reasonable error bars. 

For every signature we first define a raw statistical error bar based on Gaussian statistics. 
For an integer-valued counting signature of value n, 

(Jraw^ = VnTl. (37) 


For a ratio-valued counting signature of value n/m, where n and m are pseudo-independent 
integer-valued counts. 


/n\_ / \/n + 1V / n^/m + 1V 

\m) \\m + lj 


(38) 


For a histogram with n elements and a quantile boundary with value q, we dehne giow (o'high) 
as the new value of the quantile boundary if ^/n elements were added to the lowest (highest) 
bin of the histogram. The raw statistical error on q is dehned as 




(g - qiowY + (g - ghigh)^ 


(39) 


While these raw statistical error bars scale like Gaussian statistics, there is no guarantee 
that Gaussian statistics is a good description of the errors from PYTHIA piped through 
PGS. Therefore, for the i-th signature, we dehne the statistical huctuation as 




(40) 


where ai is an 0{1) phenomenological parameter that is different for each signature. To 
determine the value of a, we ran a subset of our models again with a different random 
number seed. Then for every signature, we calculate 


a}^ = 


raw Si 


n2 


+ (<5ri 


. 242 ’ 


(41) 
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where and sj are the signature values of the same model with different random number 
seeds. The a, we use in subsequent calculations is the 95th percentile of taken over all 
models or 1, whichever is largest. The selected values never exceeded 2.2, telling us that 
Gaussian statistics are a good approximation to statistical error bars over a wide range of 
models and signatures. 
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